Improving the lexical coverage of English compound adjectives in syntactic parsing
نویسنده
چکیده
The present paper addresses the question how in syntactic parsing the coverage of words in previously unseen text may be improved. The adjectives in English are presented here as a case study. Working on the assumption that most new words that are introduced into the language are constructed on the basis of already existing words through the application of word-formation processes, we investigate the role that different word-formation processes play, more specifically in the formation of adjectives in English. An analysis of adjectives in the BNC shows that in the case of adjectives compounding is the word-formation process that is most productive. Moreover, compound adjectives are not formed by combining bases at will; rather, a limited set of fairly simple rules apply that restrict the co-occurrence of bases. This makes it feasible to develop an approach for handling compound adjectives which is rather effective, as is evident from the results from a first implementation where of a set of 30,561 compound adjectives derived from the BNC, 88.68% were correctly identified Proceedings of the 18th Meeting of Computational Linguistics in the Netherlands, pp. 117–130 Edited by: Suzan Verberne, Hans van Halteren, Peter-Arno Coppen. Copyright c ©2008 by the authors. Contact: [email protected]
منابع مشابه
Improving the lexical coverage of English compound adjectives
The present paper addresses the question how in syntactic parsing the coverage of words in previously unseen text may be improved. The adjectives in English are presented here as a case study. Working on the assumption that most new words that are introduced into the language are constructed on the basis of already existing words through the application of word-formation processes, we investiga...
متن کاملThe Relationship between Syntactic and Lexical Complexity in Speech Monologues of EFL Learners
: This study aims to explore the relationship between syntactic and lexical complexity and also the relationship between different aspects of lexical complexity. To this end, speech monologs of 35 Iranian high-intermediate learners of English on three different tasks (i.e. argumentation, description, and narration) were analyzed for correlations between one measure of sy...
متن کاملA Contrastive Analysis of Sports Headlines in Two English Newspapers
It holds true that a flourishing fieldof Contrastive Rhetoric (CR) research has begun to address theway various text types and/or genres may differ across culturesand languages (Corner, 1996). Very much in line withthis development, this study was an attempt to characterizethe linguistic structures of headlines in the sports section of 2 English newspapers: one non-Iranian (The Times) and one ...
متن کاملThe Effect of Reducing Lexical and Syntactic Complexity of Texts on Reading Comprehension
The present study investigated the effect of different types of text simplification (i.e., reducing the lexical and syntactic complexity of texts) on reading comprehension of English as a Foreign Language learners (EFL). Sixty female intermediate EFL learners from three intact classes in Tabarestan Language Institute in Tehran participated in the study. The intact classes were assigned to three...
متن کاملAn improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کامل